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ABSTRACT 


This  paper  deals  with  a Bayes  sequential  sampling  procedure 
for  selecting  the  most  probable  event  from  a multinomial  distri- 
bution whose  parameters  are  distributed  a priori  according  to  a 
Dirichlet  distribution.  The  given  rule  is  compared  with  other 
sampling  rules  which  have  been  considered  in  the  literature. 
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1.  Introduction . Consider  a multinomial  population  with  k > 2 
cells  and  the  associated  probability  vector  p = (p^,...,pk)  where 

p^  = 1.  Let  £ p[2]  — ***  — p(k]  denote  the  ordered 

values  of  p^...^^.  A number  of  statistical  procedures  based 
on  sequential  and  non-sequential  sampling  rules  have  been  con- 
sidered in  the  literature  for  selecting  the  most  probable  event, 
that  is,  the  event  associated  with  p^j.  The  papers  of  Bechhofer 
et  al.  (1959),  Cacoullos  and  Sobel  (1966),  Alam  (1971)  and  Ramey 
and  Alam  (1979)  may  be  cited  for  reference.  Gupta  and  Nagel  (1967) 
have  considered  a procedure  for  selecting  a subset  of  the  k events 
which  includes  the  most  probable  event.  In  line  with  the  clas- 
sical approach  of  ranking  and  selection  methods,  the  central 
theme  of  these  papers  is  to  find  the  least  favorable  configuration, 
that  is,  the  value  of  p for  which  the  probability  of  a correct 
selection  is  minimized,  and  to  determine  the  sample  size  for 
the  fixed  sample  procedure  and  the  expected  sample  size  for 
the  sequential  sampling  rule,  for  which  the  minimum  probability 
of  a correct  selection  is  equal  to  a given  number  p* . In  this 
paper  we  consider  the  selection  problem  from  a Bayesian 
approach . 

The  multinomial  probability  function  is  given  by 


n * i x 

(1*1)  ^ (^^ , • • • ,x^)  3 (x^,...,Xj^)  p ^ ... 

where  xi  = n*  Let  D(v^,...,v^)  denote  the  Dirichlet  dis- 

tribution, given  by  the  density  function 


(2) 


(1.2) 
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Prom  (1.1)  and  (1.2)  it  is  seen  that  the  Dirichlet  distribution 
is  a conjugate  prior  distribution  for  p.  We  consider  a Bayes 
sequential  sampling  procedure  for  selecting  the  most  probable 
event,  assuming  a Dirichlet  prior  distribution  for  p and  the 
loss  function  being  given  as  follows : Let  c > 0 denote  the 


cost  of  sampling  per  unit  observation  and  let  d^  denote  the 


decision  to  select  the  event  associated  with  the  ith  cell  as 
the  most  probable  event,  after  the  sampling  is  stopped.  If  n 
observations  have  been  taken  the  loss  is  given  by 


(1.3) 


L(di,p)  = nc  + (pjkj-pi) 


The  theory  of  the  Bayes  sequential  sampling  procedure  is  given 
in  Section  2 and  its  application  is  shown  in  Section  3.  In 
Section  4 we  compare  its  risk,  that  is  the  expected  loss,  with 
the  risk  of  the  fixed  sample  procedure  of  Bechhofer  and  the 
sequential  sampling  rule  of  Cacoullos  and  Sobel. 


2.  Bayes  sequential  sampling  rule.  Suppose  that  p is  distri- 
buted a priori  according  to  the  Dirichlet  distribution 
D (v^, . . . , v^) • Then  the  posterior  distribution  of  p given  the 
observation  x = (x^,...,x^)  is  Dtv^+x^ , . . . , v^+x^) . Marginally, 
p^  is  distributed  according  to  D(v^,  v-v^)  and  its  mean  is 
given  by 


1 ■ — 
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(2.1) 


E Pi  - v./v  . 


After  the  sampling  is  stopped,  the  terminal  decision  should  be 
a Bayes  rule  with  respect  to  the  posterior  distribution  of  p. 
It  follows  from  (1.3)  and  (2.1)  that  the  terminal  decision  is 


di  if 


(2.2) 


^ - max  (Xj^+Vj^,  . . . ,xk+vJt) 


From  (1.3)  it  is  seen  that  n < 1/c  since  the  loss  from  an 
immediate  decision  without  taking  an  observation  is  < 1.  There- 
fore, we  need  consider  only  bounded  sequential  procedures.  Let 
D*  - D(v^,...,vk)  and  let  pj(D*)  denote  the  Bayes  risk  due  to  an 
optimal  sequential  sampling  rule  in  which  not  more  than  j obser- 
vations should  be  taken.  From  (1.3)  and  (2.1)  it  is  seen  that 


(2.3) 


C»o(D*)  * E p{kj  - max  ( v ^ , . . . , v*k ) / v . 


The  marginal  distribution  of  a single  observation  y * (y.,...,y.  ) 

' 1 K 

is  given  by 


Plyi  - 1.  yj  - 0,  ) f i!  - -ji,  i - I k. 


Let  D*(y)  denote  the  posterior  distribution,  given  y.  From 
backward  induction  (see  e.g.  Degroot  (1970),  §12.5),  a recur- 
sive formula  for  the  Bayes  risk  is  given  by 


H 


i 
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(2.4)  Pj+1(D*)  « min  {pQ(D*),  E p ^ (D* (y)  + c} 


min  (p0(D*),  ^ pj(Di*)  + c} 


j =•  0,1,...  ,m-l 


where  m denotes  the  largest  integer  < 1/c  and  D.* 


D(v1# »vi-i»vi+l,vi+i,... ,vk) • Putting  j » 0 in  (2.4)  we  get 


(2.5)  px(D*)  * min  {pQ(D*),  E pQ(D*(y))  + c} 
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The  recursive  relation  (2.4)  gives  the  stopping  rule  for 
the  Bayes  sequential  sampling  procedure,  as  follows:  Take  no 


observation  if  Pg(D*)  <_  °m^D*^'  Otherwise,  take  observations 


sequentially.  Stop  sampling  after  taking  n observations  if 
P0(DMx))  < pm_n(D*(x)  ) . 


3-  Application . Consider  the  special  case  in  which 


V * vo • say*  Let  Dg  = D(Vq,...,Vq)  and  let  pg  denote 
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the  expected  value  of 
bution  Dq.  From  (2.3)  and 


with  respect  to  the  prior  distri- 
(2.5)  we  get 


VDo’ 
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Table  I below  shows  the  continuation  set,  that  is,  the  set  of 
sample  points  or  the  cell  counts  where  the  sampling  should  be 
continued,  for  * 1,  c • .01  and  k * 2(1)5.  The  complement 
of  the  continuation  set  represents  the  set  of  sample  points 
where  the  sampling  should  be  stopped.  The  value  of  n in  the 
table  represents  the  number  of  observations  or  the  stage  of 
sampling.  For  example,  let  k * 2 and  n * 10.  The  table  shows 
that  the  sample  point  (5.5)  lies  in  the  continuation  set. 
Therefore,  if  10  observations  have  been  taken  and  the  cell 
counts  are  (5,5)  then  at  least  one  more  observation  should  be 
taken.  On  the  other  hand,  if  the  cell  counts  are  (6,4),  then 
the  sampling  is  stopped  and  the  cell  associated  with  the  count 
number  6 should  be  selected  as  the  most  probable  event. 

4.  Comparison  of  three  sampling  rules.  The  sampling  scheme 
of  Cacollous  and  Sobel  (1966)  in  which  observations  are  taken 
sequentially  until  the  largest  count  in  any  cell  is  equal  to 
a given  positive  integer  N,  has  been  called  inverse  sampling 
(IS) . It  would  be  interesting  to  compare  the  Bayes  sequential 


(6) 


sampling  rule  given  above  with  the  IS  and  the  fixed  sample  (FS) 
rule  of  Bechhofer  (1959) . Suppose  that  p is  distributed  a 
priori  according  to  the  Dirichlet  distribution  DQ.  Since  the 
terminal  decision  for  both  the  IS  and  FS  rules  is  to  select  the 
event  associated  with  the  largest  cell  count,  it  is  Bayes  with 
respect  to  D^. 

The  value  of  pm(DQ) » the  risk  of  the  Bayes  sequential 
sampling  rule  is  obtained  from  (2.4).  Formulas  for  the  risk 
of  FS  and  IS  being  denoted  by  p(FS)  and  p(IS),  respectively, 
are  given  in  the  Appendix.  Let  p*(FS)  denote  the  minimum 
value  of  p(FS)  minimized  for  all  values  of  n,  the  fixed  sample 
size.  Similarly,  let  p*(I S)  denote  the  minimum  value  of  p(IS), 
minimized  for  all  values  of  N.  Table  II  below  gives  values 
of  p(FS)  and  p(IS)  for  certain  values  of  vQ,  c and  k,  where 


p* (FS)  - Pm(Dn) 

P(FS)  * jV(Fsp — — * 100,  p(IS) 


o* (IS)  - p (Dn) 
m 0 

p* (IS) 


\ 


100. 


The  values  of  p(FS)  and  p(IS)  represent  the  percentage  reduction 
in  the  risk  due  to  the  optimal  Bayes  sequential  sampling  rule, 
compared  with  the  FS  and  IS  sampling  rules,  respectively.  It 
is  seen  from  the  table  that  p(FS)  > p(IS)  for  any  given  value 
of  (c,  vQ,  k) , and  that  p(FS)  and  p(IS)  are  both  decreasing  in 
c for  any  given  value  of  (v^,  k) . 


(7) 


APPENDIX 


5.  Values  of  pm(D0) , P (FS)  and  P(IS).  From  (1.1)  and  (1.2)  we 
obtain  the  marginal  distribution  of  x,  given  by  the  probability 
function 


(5.1) 


h (x) 


r (kvo) 


r (kv0+n)  (r  (v0) ) 


(x 


1' 


•V 


"i-1 r <v0+xi’ 


The  risk  of  FS  is  given  by 

(5.2)  P (FS)  « P0  ♦ nc  - (kv0-*-n)"1(v0  + , n 

max  (x1 , . . . ,xk) h (x) ) 

where  n denotes  the  fixed  sample  size  and  pQ  * E p^,  the  ex- 
pected  value  of  with  respect  to  the  prior  distribution  . 

Similarly,  the  risk  of  IS  is  given  by 


(5.3)  p(is)  - p0  - Ix,+...« 


. . +x^  = n+1 
X[k-1]  < x(k]  = 


N 


(kv0+n) 


-1 


V^_ 

kvQ+n+l 


- (n+1) c) h (x* ) 


where  x^j  denotes  the  ith  smallest  amongst  x^,...^^,  and 

! x^_  if  x^  f N 

V * | 

l Xj-1  if  X.  » N . 

The  value  of  Pm(DQ)  is  obtained  from  (2.4). 


r 
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It  is  seen  that  the  formulas  for  the  three  risk  functions, 
given  above,  involve  the  computation  of  E P^j*  the  expected 
value  of  pk  with  respect  to  a Dirichlet  prior  distribution. 

The  value  of  E is  derived  as  follows.  Let  y^,...,yk  be 

k independent  random  variables  and  let  y^  be  distributed 
according  to  the  gamma  distribution  with  degrees  of 
freedom,  i = l,...,k.  Let 


zi  = yi/(^iii  yi>- 

(z^,...,z^)  is  distributed  according  to  the  Dirichlet 


Then  z 


distribution  D(v^, 


,vk) 


From  the  scale  invariant  property 

k 


of  the  statistic  max  (z^,...,*^)  it  follows  that  yi  and 

max  (z^,...,z.)  are  independently  distributed.  Hence 


(5.3) 


E P[k]  = E max  (z^  . . . ,zk) 
E max  (y1#. . . ,yk) 


E 


i-1  yi 


Thus,  kvQpQ  represents  the  expected  value  of  the  largest  order 
statistic  in  a samp’e  size  k from  the  gamma  distribution  . 
The  moments  of  order  statistics  from  the  gamma  distribution 
have  been  tabulated  by  Harter  (1969)  . 
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TABLE  I - Continuation  set  of  sample  points  for  = 1,  c = .01 


n 

m 

k = 3 

k = 

4 

k 

= 5 

1 

(1,0) 

(1,0,0) 

(1,0, 0,0) 

(1,0, 0,0,0) 

2 

(1,1) 

(1,1.0) 

(1, 1,0,0) 

(1,1, 0,0,0) 

3 

(2,1) 

(1,1,1),  (2,1,0) 

(1,1, 1,0) , 

(2, 1,0,0) 

(1,1, 1,0,0) , 

(2, 1,0, 0,0) 

4 

(2,2) 

(2,1,1),  (2,2,0) 

(1,1, 1,1) , 
(2, 2, 0,0) 

(2, 1,1,0) 

(1,1, 1,1,0)  , 

(2, 1,1, 0,0)  , (2,2,0,0,0) 

5 

(3,2) 

(2,2,1),  (3,2,0) 

(2, 1,1,1)  , 
(3, 2, 0,0) 

(2, 2, 1,0) 

(1,1, 1,1,1) , 

(2, 1,1, 1,0),  (2, 2, 1,0,0) 

6 

(3,3) 

(2,2,2),  (3,2,1) 
(3,3,0) 

(2, 2, 1,1)  , 
(3, 3, 0,0) 

(2, 2, 2,0) 

(2, 1,1, 1,1)  , 

(2, 2, 1,1,0),  (2,2,2,0,01 

7 

(4,3) 

(3,2,2),  (3,3,1) 
(4,3,0) 

(2, 2, 2,1) , 
(3, 3, 1,0) 

(3, 2, 2,0) 

(2, 2, 1,1,1)  , 

(2, 2, 2, 1,0),  (3, 2, 2, 0,0) 

8 

(4,4) 

(3,3,2),  (4,4,0) 

(2, 2, 2, 2) , 
(3, 3, 1,1) , 

(3, 2, 2,1) 
(3, 3, 2,0) 

(2, 2, 2, 1,1)  , 

(2, 2, 2, 2,0),  (3, 3, 2, 0,0) 

9 

(5,4) 

(3,3,3),  (4,4,1) 
(5,4,0) 

(3, 2, 2, 2), 
(3, 3, 3,0) 

(3, 3, 2,1) 

(2, 2, 2, 2,1)  , 
(3, 3, 3, 0,0) 

(3, 2, 2, 2,0),  (3, 3, 2, 1,0) 

10 

(5,5) 

(4,3,3),  (4,4,2) 
(5,5,0) 

(3, 3,2,2)  , 
(4, 3, 3,0) 

(3, 3, 3,1) 

(2, 2, 2, 2, 2)  , 
(3, 3, 3, 1,0) 

(3, 3, 2, 1,1),  (3, 3, 2, 2,0) 

11 

(6,5) 

(4,4,3),  (5,5,1) 

(3, 3, 3, 2) , 

(4, 4, 3,0) 

(3, 2, 2, 2, 2)  , 
(3, 3, 3, 2,0) 

(3, 3, 2, 2,1),  (3, 3, 3, 1,1) 

12 

(6,6) 

(4,4,4),  (5,5,2) 

(3, 3, 3, 3) , 
(4, 4, 4,0) 

(4, 4, 3,1) 

(3, 3, 2, 2, 2), 

(3, 3, 3, 2,1),  (3, 3, 3, 3,0) 

13 

(5,4,4),  (5,5,3) 

(4, 3, 3, 3)  , 
(4, 4. 4,1) 

(4,4, 3,2) 

(3, 3, 3, 2, 2)  , 

(3, 3, 3, 3,1) 

14 

(5,5,4) 

(4, 4, 3, 3)  , 

(4, 4, 4, 2) 

(3,3,3,3,21 

15 

(5,5,5) 

(4, 4, 4, 3) 

(3,3, 3, 3, 3) 

16 

<J> 

(4, 4, 4, 4) 

4> 

17 

$ 

<p  - denotes  the  null  set 


i 

t 


.5 

.01 

.005 

.001 

19.9 

25.4 

39.2 

9.8 

12.5 

25.8 

p(FS) 

21.6  24.4  33.8 

18.4 

19.8  28.2 

13.3 

16.0  23.2 

1 p(IS) 

6.7  9.2  19.8 

4.7 

6.8  16.0 

2.7 

5.1  12.7 

19.9 

21.7 

30.2 

15.7 

CO 

h* 

27.0 

11.5 

14.2 

20.0 

4.7 

7.0 

16.4 

3.3 

5.1 

12.9 

0.7 

3.7 

9.9 

18.1 

20.4 

24.8 

13.4 

16.7 

22.0 

8.0 

13.  3 

17.5 

3.6 

5.7 

11.2 

2.0 

5.2 

10.5 

.0 

1.8 

7.8 

.0  2 

.5  1 
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